A Chaotic Antlion Optimization Algorithm for Text Feature Selection

نویسندگان

چکیده

Abstract Text classification is one of the important technologies in field text data mining. Feature selection, as a key step processing tasks, used to process high-dimensional feature sets, which directly affects final performance. At present, most widely selection methods academia are calculate importance each for through an evaluation function, and then select subsets that meet quantitative requirements turn. However, ignoring correlation between features effect their mutual combination this way may not guarantee best effect. Therefore, paper proposes chaotic antlion algorithm (CAFSA) solve problem. The main contributions include: (1) Propose (CAA) based on quasi-opposition learning mechanism chaos strategy, compare it with other four algorithms 11 benchmark functions. has achieved higher convergence speed highest optimization accuracy. (2) Study performance CAFSA using CAA when different models, including decision tree, Naive Bayes, SVM classifier. (3) compared eight three Chinese datasets. experimental results show can reduce number improve accuracy classifier, better than methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification

In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...

متن کامل

An Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification

In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

Text Feature Selection using Particle Swarm Optimization Algorithm

Text Categorization (TC) has become recently an important technology in the field of organizing a huge number of documents. Feature Selection (FS) is commonly used to reduce dimensionality of text datasets with huge number of features which would be difficult to process further. In this paper we have implemented an efficient feature selection algorithm based on Particle Swarm Optimization (PSO)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computational Intelligence Systems

سال: 2022

ISSN: ['1875-6883', '1875-6891']

DOI: https://doi.org/10.1007/s44196-022-00094-5